Search CORE

53 research outputs found

Sparse LS-SVMs with L0-norm minimization

Author: Brabanter Kris De
Dorronsoro José R.
López Lázaro Jorge
Suykens Johan A. K.
Publication venue: 'Universite Catholique de Louvain'
Publication date: 01/01/2011
Field of study

This is an electronic version of the paper presented at the 19th European Symposium on Artificial Neural Networks, held in Bruges on 2011Least-Squares Support Vector Machines (LS-SVMs) have been successfully applied in many classification and regression tasks. Their main drawback is the lack of sparseness of the final models. Thus, a procedure to sparsify LS-SVMs is a frequent desideratum. In this paper, we adapt to the LS-SVM case a recent work for sparsifying classical SVM classifiers, which is based on an iterative approximation to the L0-norm. Experiments on real-world classification and regression datasets illustrate that this adaptation achieves very sparse models, without significant loss of accuracy compared to standard LS-SVMs or SVMs

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Biblos-e Archivo

Optimized parameter search for large datasets of the regularization parameter and feature selection for ridge regression

Author: AN Tikhonov
B Efron
Benjamin Schrauwen
D Allen
David Verstraeten
F Ojeda
G Cawley
G Golub
G Huang
I Guyon
J Sherman
J Suykens
Joni Dambre
K Brabanter De
K Pelckmans
K Pelckmans
KA Toh
Ken Caluwaerts
M Lukoševičius
P Buteneers
Pieter Buteneers
W Press
X Dutoit
Y Miche
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

In this paper we propose mathematical optimizations to select the optimal regularization parameter for ridge regression using cross-validation. The resulting algorithm is suited for large datasets and the computational cost does not depend on the size of the training set. We extend this algorithm to forward or backward feature selection in which the optimal regularization parameter is selected for each possible feature set. These feature selection algorithms yield solutions with a sparse weight matrix using a quadratic cost on the norm of the weights. A naive approach to optimizing the ridge regression parameter has a computational complexity of the order with the number of applied regularization parameters, the number of folds in the validation set, the number of input features and the number of data samples in the training set. Our implementation has a computational complexity of the order . This computational cost is smaller than that of regression without regularization for large datasets and is independent of the number of applied regularization parameters and the size of the training set. Combined with a feature selection algorithm the algorithm is of complexity and for forward and backward feature selection respectively, with the number of selected features and the number of removed features. This is an order faster than and for the naive implementation, with for large datasets. To show the performance and reduction in computational cost, we apply this technique to train recurrent neural networks using the reservoir computing approach, windowed ridge regression, least-squares support vector machines (LS-SVMs) in primal space using the fixed-size LS-SVM approximation and extreme learning machines

Crossref

Ghent University Academic Bibliography

Inferring time-derivatives including cell growth rates using Gaussian processes

Author: A Verissimo
B Gompertz
CA Lichten
D Zeevi
E Nazarova
E Solak
FJ Richards
G Chevereau
J Monod
J Warringer
JA Kahana
JD Hunter
K De Brabanter
L Huang
M Kahm
MH Zwietering
RJ Morris
TE Oliphant
YV Bukhman
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 25/05/2016
Field of study

Often the time derivative of a measured variable is of as much interest as the variable itself. For a growing population of biological cells, for example, the population's growth rate is typically more important than its size. Here we introduce a non-parametric method to infer first and second time derivatives as a function of time from time-series data. Our approach is based on Gaussian processes and applies to a wide range of data. In tests, the method is at least as accurate as others, but has several advantages: it estimates errors both in the inference and in any summary statistics, such as lag times, and allows interpolation with the corresponding error estimation. As illustrations, we infer growth rates of microbial cells, the rate of assembly of an amyloid fibril and both the speed and acceleration of two separating spindle pole bodies. Our algorithm should thus be broadly applicable

Crossref

PubMed Central

Edinburgh Research Explorer

Optimized Parameter Search for Large Datasets of the Regularization Parameter and Feature Selection for Ridge Regression

Author: AN Tikhonov
B Efron
Benjamin Schrauwen
D Allen
David Verstraeten
F Ojeda
G Cawley
G Golub
G Huang
I Guyon
J Sherman
J Suykens
Joni Dambre
K Brabanter De
K Pelckmans
K Pelckmans
KA Toh
Ken Caluwaerts
M Lukoševičius
P Buteneers
Pieter Buteneers
W Press
X Dutoit
Y Miche
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Predicting breast cancer using an expression values weighted clinical classifier

Author: A Daemen
A Daemen
AL Boulesteix
B Dawson-Saunders
Bart De Moor
C Alzate
C Ambroise
C Sotiriou
F Chu
GH Golub
J Pittman
JAK Suykens
JAK Suykens
JAK Suykens
JAK Suykens
Johan AK Suykens
K Chin
KR Hess
Kris De Brabanter
LD Miller
LH Chun
LJ van’t Veer
M Dai
MH van Vliet
Minta Thomas
N Higham
O Alter
O Gevaert
SR Sedeh
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Efficient cross-validation for kernelized least-squares regression with sparse basis expansions

Author: A. Airola
A. Elisseeff
A. Smola
B. Schölkopf
C. E. Rasmussen
C. K. I. Williams
C. Nadeau
C. Saunders
G. C. Cawley
G. Cauwenberghs
G. H. Golub
G. Wahba
Hanna Suominen
I. E. Frank
J. Quiñonero-Candela
J. Suykens
J. Suykens
J. Suykens
Jorma Boberg
K. Brabanter De
K. Pelckmans
K. Pelckmans
M. Karasuyama
P. Green
P. Vincent
R. A. Schiavo
R. Horn
R. Kohavi
R. Rifkin
R. Rifkin
S. An
S. Kumar
T. G. Dietterich
T. Pahikkala
T. Pahikkala
T. Pahikkala
T. Pahikkala
T. Pahikkala
T. Poggio
T. Poggio
Tapio Pahikkala
V. Vapnik
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Optimized fixed-size kernel models for large data sets

Author: De Brabanter J.
De Brabanter K.
De Moor B.
Suykens J.A.K.
Publication venue
Publication date
Field of study

A modified active subset selection method based on quadratic Rényi entropy and a fast cross-validation for fixed-size least squares support vector machines is proposed for classification and regression with optimized tuning process. The kernel bandwidth of the entropy based selection criterion is optimally determined according to the solve-the-equation plug-in method. Also a fast cross-validation method based on a simple updating scheme is developed. The combination of these two techniques is suitable for handling large scale data sets on standard personal computers. Finally, the performance on test data and computational time of this fixed-size method are compared to those for standard support vector machines and [nu]-support vector machines resulting in sparser models with lower computational cost and comparable accuracy.Kernel methods Least squares support vector machines Classification Regression Plug-in estimate Entropy Cross-validation

Research Papers in Economics